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CFO 14073 US 



IMAGE PROCESSING APPARATUS 

BACKGROUND OF THE INVENTION 
Field of the Invention 

The invention relates to image processing 
apparatus, method, and system and a storage medium, in 
which a copyright can be protected* 
Related Background Art 

Hitherto, a VRML (Virtual Reality Markup Language) 
is widely and generally used as a language to describe 
a 3D (three dimension) scene. In a system using such a 
language, an arbitrary object is arranged in a 3D 
space, a sight point, a light source, a texture map, 
and the like are set to thereby construct a scene, and 
a virtual space with high reality can be formed by 
adding data such as video/audio data to each object. 

In ISO/IEC 14494-1 (MPEG-4 Systems), on the basis 
of the foregoing VRML, data to describe the scene is 
reduced and a 3D scene similar to that mentioned above 
is described by using a BIFS (Binary Format for Scene 
Description) obtained by binary expression - table 
converting the VRML, . The binarized BIFS data is 
called a BIFS stream. 

Although a detailed binarizing method is not 
mentioned here, in case of such a BIFS stream, 
different from a text such as a VRML, it is necessary 
to reconstruct a scene structure after once decoding 



the BIFS stream on the display side. 

In case of using a texture, video/audio data, or 
the like, those bit streams are also simultaneously 
multiplexed and transmitted and received as a single 
bit stream. 

Fig. 1 shows an example of a conventional 
receiving and displaying system of 3D data. 

In the diagram, reference numeral 101 denotes a 
bit stream receiving unit for receiving a bit stream 
from a line. 

Reference numeral 102 denotes a demultiplexer for 
extracting each bit stream from the single multiplexed 
bit stream. 

Reference numeral 103 denotes a BIFS decoder (BIFS 
parser) for decoding scene information to be displayed 
and forming a scene tree of a 3D object. "Scene tree" 
denotes information showing layout information of the 
objects, a mutual dependency relationship, and the 
like. Reference numeral 104 denotes an image decoder 
and shows a portion for decoding compressed image code 
data such as a JPEG file or the like. 

Reference numeral 105 denotes a video decoder for 
decoding code data of video, and 106 indicates an audio 
decoder for decoding code data of audio. 

Reference numeral 107 denotes a scene tree memory 
for storing the scene tree formed by the BIFS decoder 
103. 
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Reference numeral 108 denotes a renderer which 
finally arranges a 3D object: and a texture and 
video/audio data which are associated with the 3D 
object into a 3D space and displays and reproduces them 
5 on the basis of the scene tree stored in the scene tree 
memory 107. 

Reference numeral 109 denotes a final output 
device. For example, image information is displayed on 
a TV monitor and audio information is reproduced from a 
10 speaker. 

The bit stream is separated, decoded, and rendered 
as mentioned above and 3D displayed. 

Fig. 2 shows an example of such a kind of bit 
stream. 

15 Reference numeral 201 denotes a header/ info stream 

in which a header portion and multiplexed information 
of each stream are written. Reference numeral 202 
denotes a BIFS stream in which scene information is 
described; 203 an image data stream to which texture 

20 data or the like is transmitted; and 204 to 209 

video/audio streams in which a video stream and an 
audio stream are alternately multiplexed. In media 
such as video, audio, and the like which need a real- 
time reproduction and a synchronization, the video 

25 stream and the audio stream are often alternately 
multiplexed . 

Fig. 3 shows an example of the scene tree formed 
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by the BIFS decoder 103. However, various field data 
is omitted here. 

It will be understood that an image texture is 
adhered to a 3D object box from the scene tree shown in 
5 Fig, 3, a movie texture is adhered to a 3D object 

cylinder, and further, an audio data is reproduced. 

Fig, 4 shows a display example in the case where 
an image, video data, and audio data are rendered on 
the basis of the scene tree shown in Fig. 3. 
10 It will be understood from Fig. 4 that a 3D object 

box 401 to which an image texture has been adhered and 
a 3D object cylinder 402 to which a movie texture has 
been adhered are displayed and, at the same time, an 
audio (audio sound or audio data) 403 is reproduced. 
15 It will be obviously understood that not only the 

still image texture can be mapped but also an audio 
clip and a video clip can be mapped by the foregoing 
VRML as mentioned above. 

In recent years, there is a tendency of adopting a 
20 technique to protect a copyright with respect to the 
display of such a 3D scene. 

Specifically speaking, a method whereby a stream 
of copyright information is inserted into a bit stream, 
thereby protecting data such as texture image, 
25 video/audio data, or the like on a stream (media 
stream) unit basis is considered. 

According to such a method, the copyright 
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information is previously multiplexed into the bit 
stream. By using the method, the stream such as 
video/audio data is protected by the copyright 
information. Only in the case where the stream is 
5 authenticated by descrambling or collating it with a 
password or the like, the copyright protection is 
cancelled and the display and reproduction of 
video/audio data are started. Not only the video/audio 
streams but also a BIFS stream can be similarly 

10 protected as one media stream. 

If such a method is used, however, since the 3D 
object is not defined as a stream, a problem such that 
the 3D object itself cannot be protected occurs. 

It is now assumed as an example that a movie 

15 texture on the 3D object cylinder 402 and the audio 403 
shown in Fig. 4 are protected. 

In this case, after the rendering, as shown at 
reference numerals 405 and 404, while the movie texture 
on the 3D object cylinder 402 and the audio 403 are 

20 protected, they are not displayed and reproduced 
obviously. However, the shape of the 3D object 
cylinder 402 is displayed as it is in a gray color 
which has been set as a color of a default as shown in 
Fig. 5. 

25 If the user wants to set such that the 3D object 

cylinder is not displayed, since the 3D object has been 
defined by the BIFS stream, the BIFS stream itself has 



- 6 - 



to be protected. 

In such a case, however, the 3D object box itself 
is not displayed neither in a manner similar to the 3D 
object cylinder at this time, 
5 It is, therefore, considered to previously divide 

the BIFS stream every 3D object and protect only the 
stream which defines the 3D object cylinder. However, 
it is not easy to divide the BIFS stream and each time 
the 3D object is moved, modified, extinguished, or 

10 newly appears, the BIFS stream corresponding thereto 
has to be updated any time or the like, so that a 
problem such that processes become complicated occurs. 
In case of using the VRML, it is also considered 
to form a VRML file corresponding to each 3D object and 

15 describe the whole 3D scene so as to individually 

recognize each of a plurality of 3D objects. In this 
case, however, a problem such that the VRML file has to 
be complicatedly formed occurs. 

20 SUMMARY OF THE INVENTION 

In consideration of the above problems, it is an 
object of the invention to provide image processing 
apparatus, method, and system and a storage medium, in 
which a copyright with respect to an arbitrary 3D 

25 object can be extremely simply and easily protected 
without performing a troublesome process such that a 
stream of BIFS is divided into a plurality of streams. 
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To accomplish the above object, according to a 
preferred embodiment of the invention, there is 
disclosed an image processing apparatus for displaying 
a three-dimensional scene, comprising identifying means 
5 for identifying a 3 -dimensional object having 

copyright-protected information among 3 -dimensional 
objects constructing the 3-dimensional scene on the 
basis of data describing the 3-dimensional scene; and 
display inhibiting means for inhibiting a display of 
10 the 3-dimensional object identified by the identifying 
means until a predetermined authenticating process is 
finished. There are also disclosed an information 
processing method for such an information processing 
apparatus and a storage medium which stores a program 
15 to realize such an information processing method. 

To accomplish the above object, according to 
another preferred embodiment of the invention, there is 
disclosed an image processing system comprising a 
transmitting apparatus and a receiving apparatus, 
20 wherein the transmitting apparatus includes 

transmitting means for transmitting scene data 
describing a 3-dimensional scene, media data associated 
with the scene data, and copyright-protected data, and 
the receiving apparatus includes receiving means for 
25 receiving the scene data describing the 3-dimensional 
scene, media data associated with the scene data, and 
copyright-protected data which were transmitted from 



the transmitting apparatus, separating means for 
separating all of the data received by the receiving 
means, access control means for controlling accesses to 
the scene data and the media data which were separated 
by the separating means on the basis of the copyright- 
protected data separated by the separating means, media 
decoding means for decoding the media data separated by 
the separating means, scene decoding means for forming 
copyright-protected scene data and copyright- 
unprotected scene data from the scene data separated by 
the separating means on the basis of the copyright- 
protected data separated by the separating means, and 
rendering means for rendering the 3-dimensional scene 
on the basis of the media data decoded by the media 
decoding means and the copyright -protected scene data 
and copyright-unprotected scene data formed by the 
scene decoding means. 

The above and other objects and features of the 
present invention will become apparent from the 
following detailed description and the appended claims 
with reference to the accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a constructional diagram of a 3D 
reproducing system; 

Fig. 2 shows an example of a construction of a bit 
stream which is processed in the 3D system of Fig. 1; 
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Fig, 3 is a diagram showing an example of a scene 

tree; 

Fig. 4 is a diagram showing an example of a 
rendering result; 
5 Fig. 5 is a diagram showing an example of a 

rendering result of a scene whose copyright has been 
protected; 

Fig. 6 is a constructional diagram of a 3D 
reproducing system according to the first embodiment; 
10 Fig. 7 is a diagram showing an example of a bit 

stream whose copyright has been protected; 

Fig. 8 is a diagram showing divided scene trees; 

Fig. 9 is a diagram showing an example of a 
rendering result of a scene whose copyright has been 
15 protected according to the first embodiment; 

Fig. 10 is a constructional diagram of a 3D 
reproducing system according to the second embodiment; 

Fig. 11 is a timing chart for a 3D reproducing 
process according to the second embodiment; and 
20 Fig. 12 is a diagram showing an example of a 3D 

description by a VRML according to the third 
embodiment . 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
25 Fi <J- 6 shows an example of a receiving and 

displaying system of 3D data according to the first 
embodiment of the invention. 
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In -the diagram, reference numeral 601 denotes a 
bit stream receiving unit for receiving a bit stream 
from a line. # 

The bit stream receiving unit 601 is not always 
5 limited to a receiving unit in communication but can be 
a receiving unit for receiving a bit stream obtained by 
reading out data from a recording media or the like* 

Reference numeral 602 denotes a demultiplexer for 
extracting each bit stream from a single multiplexed 
10 bit stream* 

Reference numeral 603 denotes an IPMP 
(Intellectual Properly Management and Protection) 
manager for controlling an access control of a stream 
controller 604, which will be explained hereinlater, in 
15 accordance with copyright information extracted by the 
demultiplexer 602. 

Reference numeral 604 denotes the stream 
controller for transmitting a media stream ( stream such 
as image, video, audio, or the like) to subsequent 
20 media decoders such as BIFS decoder 605, image decoder 
606, video decoder 607, audio decoder 608 only in the 
case where the authentication is normally performed by 
the IPMP manager 603 . 

When the media stream itself is protected by 
25 enciphering or the like, the stream controller 604 

properly decodes an encryption by the control of the 
IPMP manager 603 and, thereafter, transmits a bit 
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stream to the media decoder corresponding to each media 
stream. 

Reference numeral 605 denotes the BIFS decoder 
(B1FS parser) for decoding scene information to be 
5 displayed, divide a scene into a protection node and an 
unprotection node (node in which the display can be 
performed as it is), and forms two scene trees of a 
protected scene tree and an unprotected scene tree. 

Reference numeral 606 denotes the image decoder 

10 and shows a portion for decoding a compressed image 
code data such as a JPEG file. 

Reference numeral 607 denotes the video decoder 
for decoding video code data, and 608 indicates the 
audio decoder for decoding audio code data. 

15 Reference numeral 609 denotes an unprotected scene 

tree memory for storing the unprotected scene tree 
formed by the BIFS decoder 605, and 610 indicates a 
protected scene tree memory for storing the protected 
scene tree formed by the BIFS decoder 605. 

20 Reference numeral 611 denotes a render er for 

finally arranging a 3D object and a texture and 
video/audio data which are associated with the 3D 
object into a 3D space and displaying and reproducing 
them on the basis of the scene trees stored in the 

25 unprotected scene tree memory 609 and protected scene 
tree memory 610. 

The data belonging to the unprotected scene tree 
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±s unconditionally rendered. The data belonging to the 
protected scene tree is rendered after the copyright 
information is cancelled and a tree structure is 
reconstructed * 
5 Reference numeral 612 denotes a scene parent 

memory for storing scene parent information, which will 
be explained hereinlater. 

Reference numeral 613 denotes a final output 
device* For example, an image is displayed on the TV 
10 monitor and an audio sound is reproduced from the 
speaker . 

Fig, 7 shows an example of a bit stream according 
to the first embodiment of the invention. 

Reference numeral 701 denotes a header/info stream 

15 to which a header portion and multiplex information of 
each stream are written. Reference numeral 702 denotes 
an IPMP stream in which copyright information is 
described and 704 indicates a BIFS stream in which 
scene information is described. 

20 Reference numeral 705 denotes an image data stream 

in which texture data or the like is transmitted. 

Further, reference numerals 706 to 711 denote 
video/audio streams in which a video stream and an 
audio stream are alternately multiplexed. 

25 Hatched portions of the video/audio streams 706 to 

711 denote that they are protected by the copyright 
information of the IPMP stream 702. 
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That is, as for the video/audio streams 706 to 
711, only when they are authenticated by descrambling, 
password collation, or the like, the copyright 
protection is cancelled and the display and 
reproduction of the video/audio data are started. 

Fig. 8 shows examples of the unprotected scene 
tree and protected scene tree formed by the B1FS 
decoder 605 . 

Even in the first embodiment of the invention, it 
is assumed that the movie texture on the 3D object 
cylinder 402 and the audio 403 in Fig. 4 are protected 
by the copyright information. 

In Fig. 8, therefore, a box node in which the 
image texture has been mapped is formed as an 
unprotected scene tree 801. On the contrary, a 
cylinder node in which the movie texture has been 
mapped and a node of audio mapped to the whole scene is 
formed as a protected scene tree 802. 

Since node IDs ( = 1 to 9 ) are allocated to the 
nodes, respectively, and ROOT (root of the scene) which 
is defined by ID = 0 is the unprotection node, even if 
the copyright protection is not cancelled, the scene 
can be constructed only by the unprotected scene tree 
801. 

Since the ROOT defined by ID = 0 does not exist in 
the protected scene tree 802, a scene parent 
information showing to which position in the scene each 
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node belonging to the protected scene tree 802 is 
connected is stored in the scene parent memory 612. 
Specifically speaking, the scene parent information 
which is stored in the scene parent memory 612 
comprises a set of a node ID to be linked and an ID of 
its parent node. 

In Fig. 8, a set of ID = 5 and ID = 1 and a set of 
ID = 8 and ID = 0 (ROOT) are stored in the scene parent 
memory 612. In this case, although only one child node 
is linked with respect to each parent node, a plurality 
of child nodes can obviously exist. 

Although the details of an internal construction 
of the scene parent memory 612 are not described here, 
for example, a method such that a child node ID is 
written subsequently to the parent node ID and the node 
ID is terminated by a unique code which does not 
overlap to the ID number is considered. 

When the copyright protection is not cancelled 
here, since the scene is constructed only by the 
unprotected scene tree 801, it is displayed as shown in 
Fig. 9. 

As will be obviously understood from Fig. 9, while 
the movie texture on the 3D object cylinder 402 and the 
audio 403 are protected, they are not displayed nor 
reproduced and the shape of the 3D object cylinder 402 
is not at all displayed as well. 

When the copyright protection is cancelled, since 
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the scene is constructed by both the unprotected scene 
tree 801 and protected scene tree 802, it is displayed 
as shown in Fig. 9. 

Specifically speaking, when the scene is 
5 reconstructed, the scene parent information is read out 
from the scene parent memory 612 and a shape node 
defined by ID = 5 is linked as a child node of a 
transform node defined by ID = 1, thereby displaying 
the 3D object cylinder 402 having the movie texture. 

10 On the other hand, a sound node defined by ID = 8 is 
linked as a child node of ROOT defined by ID = 0, 
thereby reproducing the audio 403 . 

According to the first embodiment as described 
above, by forming the two scene trees of the protected 

15 scene tree and the unprotected scene tree on the basis 
of the protection node and the unprotection node 
included in the BIFS stream, the copyright protection 
of the 3D object and the media associated therewith can 
be easily performed. 

20 Although the first embodiment can be realized by 

hardware, the whole system can be obviously realized by 
software. 

Fig. 10 shows an example of a receiving/displaying 
system of 3D data according to the second embodiment. 
25 In the second embodiment, besides the construction 

of the first embodiment shown in Fig. 6, a release 
timing controller 1001 is added. 
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A case where the movie texture is adhered to the 
3D object cylinder and a copyright of the scene is 
protected in a manner similar to the first embodiment 
will now be presumed. 
5 When the copyright protection is cancelled by 

obtaining the authentication, the display of the 3D 
object cylinder and the movie texture is started. In 
this case, however, if the decoding of the movie 
texture is started before the rendering of the 3D 
10 object cylinder is finished, the scene is not normally 
formed. Further, it is also necessary to synchronize 
the movie texture and the audio again. 

In the second embodiment, therefore, the timing 
for rendering after the copyright protection is 
15 cancelled is adjusted by the release timing controller 
1001. 

Fig. 11 shows a control example of the release 
timing controller 1001. 

In the second embodiment, it is assumed that a 

20 copyright is not protected at the start of the display 
and both the 3D object and the video/ audio are normally 
reproduced until time tl on the halfway. The 
protection of a copyright of the 3D object is started 
at time tl . Since the protection is cancelled at time 

25 t2, a period of time between time tl and t2 corresponds 
to an IPMP operation time of the 3D object. Similarly, 
a period of time between time t3 and t4 corresponds to 
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a time necessary for processes of IPMP of video. 

In such a state, by setting a final undisplaying 
period of time to a period between time tl and t4, the 
release timing controller 1001 performs a control so as 
5 not to cause an inconvenience in the synthesis of the 
scene . 

Fig. 12 shows an example of description of a 3D 
scene in case of the third embodiment in which the 
technique realized in the system according to the first 
10 embodiment is applied to the VRML. 

Explanation will now be made in detail hereinbelow 
while tracing the lines of the description of the 3D 
scene . 

The description regarding points which are not 
15 concerned with the present invention is omitted 

although they are necessary to explain the VRML. The 
line number is written at the line end of each line. 

The first line relates to a node to group the 
objects . 

20 In the 2nd to 7th lines, parameters such as layout 

position, angle of rotation, and the like of the 
objects are set. 

In the 8th and 9th lines, the kind of figure is 
defined. In this example, a box is arranged. A box 

25 node has parameters of lateral, vertical, and height as 
a field (showing attributes which are peculiar to the 
node). In this case, they are set to a value of "1". 
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In the 10th to 12th lines, a surface shape 
(texture) of a box is defined. In the 13th line, 
"Texturel.jpg" (JPEG file) is shown as a name of the 
file of image texture which is actually texture mapped. 

In the 19th and subsequent lines, similarly, a 
cylinder is arranged at a position different from that 
of the box and "Texture2 . mpg" (MPEG file) is mapped as 
a surface shape (texture). In this case, since the 
video is designated as a source of the texture, it is 
called a movie texture and a motion image is reproduced 
on the cylinder. 

In the (24-l)th to (24-4)th lines, a new node 
"protect" is used. This node is a kind of group node 
(which is used when several nodes are handled in a 
lump) and has a url (Uniform Resource Locator) field. 
Although the cylinder node is linked to "IPMPl.dat", it 
shows a link of the cylinder node to the copyright 
information. This "protect" node is nothing but one 
description example and another expression can be also 
used. 

In the (35-l)th and (35-2)th lines, the "protect" 
node is also used in a manner similar to the case 
mentioned above and the audio node is linked to 
"IPMPl.dat" here. 

In the 36th to 39th lines, an audio source is 
defined and 11 Sound. mpg" (MPEG audio file) is 
simultaneously reproduced as a sample when the scene is 



displayed. 

When -the information processing apparatus 
reproduces the 3D scene on the basis of the VRML, the 
information processing apparatus executes the following 
processes. 

That is, first, the VRML is read and the "protect" 
node is detected. Subsequently, when the "protect" 
node is detected, the rendering of the portion grouped 
by the "protect" node is temporarily stopped. When it 
is determined that the inhibition can be cancelled due 
to the authenticating process of a copyright, the 
rendering of the portion grouped by the "protect" node 
is performed. 

When the protection of a copyright is not 
cancelled, since the rendering of the portion grouped 
by the "protect" node is inhibited, it is displayed as 
shown in Fig. 9. When the protection of a copyright is 
cancelled, since the rendering of the portion grouped 
by the "protect" node is also performed, it is 
displayed as shown in Fig. 4. 

Although both the cylinder node and the audio node 
have the same copyright information in the third 
embodiment, the cylinder node and the audio node can 
also have different copyright information by allowing 
the cylinder node to link to "IPMPl.dat" and allowing 
the audio node to link to "lPMP2.dat". 

According to the third embodiment as described 
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above, by adding the protection node such as "protect" 
node to the VRML, the copyright protection of the 3D 
object and the media associated therewith can be easily 
performed. 

5 As described above, a copyright of the 3D object 

and the texture and video/audio which are associated 
with the 3D object and the like can be integratedly and 
extremely easily controlled. 

Many widely different embodiments of the present 

10 invention may be constructed without departing from the 
spirit and scope of the present invention. It should 
be understood that the present invention is not limited 
to the specific embodiments described in the 
specification, except as defined in the appended 

15 claims. 
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WHAT IS CLAIMED IS: 

1. An image processing apparatus for displaying a 
3 -dimensional scene, comprising: 

(A) identifying means for identifying a 3- 
dimensional object having copyright-protected 
information among 3-dimensional objects constructing 
said 3-dimensional scene on the basis of data 
describing said 3-dimensional scene; and 

(B) display inhibiting means for inhibiting a 
display of the 3-dimensional object identified by said 
identifying means until a predetermined authenticating 
process is finished. 

2. An apparatus according to claim 1, further 
comprising reproduction inhibiting means for inhibiting 
a reproduction of video/audio in the case where said 3- 
dimensional object whose display is inhibited by said 
display inhibiting means is accompanied with the 
video/audio. 

3. An apparatus according to claim 2, further 
comprising synchronizing means for, in the case where 
said 3-dimensional object whose display is inhibited by 
said display inhibiting means is accompanied with the 
video/audio, synchronizing the display of said 3- 
dimensional object with the reproduction of said 
video/audio when the inhibition of the display by said 
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display inhibiting means is cancelled* 

4. An image processing apparatus for displaying a 
3-dimensional scene, comprising: 

(A) identifying means for identifying a 3- 
dimensional object having copyright-protected 
information among 3-dimensional objects constructing 
said 3-dimensional scene on the basis of data 
describing said 3-dimensional scene; and 

(B) classifying means for classifying the 3- 
dimensional object identified by said identifying means 
to a first group and classifying the other 3- 
dimensional objects to a second group; and 

(C) display control means for controlling the 
display of said 3-dimensional scene on the basis of the 
group classified by said classifying means. 

5. An apparatus according to claim &, wherein said 
classifying means further classifies the 3-dimensional 
object identified by said identifying means and 
video/audio associated with said 3-dimensional object 
to the first group and classifies the other 3- 
dimensional objects and video/audio associated with 
said other 3-dimensional objects to the second group. 

6. An image processing apparatus comprising: 
(A) receiving means for receiving scene data 
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describing a 3 -dimensional scene, media data associated 
with said scene data, and copyright-protected data; 

(B) separating means for separating all of the data 
received by said receiving means; 

(C) access control means for controlling accesses to 
the scene data and the media data separated by said 
separating means on the basis of the copyright- 
protected data separated by said separating means; 

(D) media decoding means for decoding the media data 
separated by said separating means; 

(E) scene decoding means for forming copyright- 
protected scene data and copyright-unprotected scene 
data from the scene data separated by said separating 
means on the basis of the copyright-protected data 
separated by said separating means; and 

(F) rendering means for rendering the 3-dimensional 
scene on the basis of the media data decoded by said 
media decoding means and the copyright -protected scene 
data and the copyright -unprotected scene data formed by 
said scene decoding means, 

7. An apparatus according to claim 6, wherein said 
copyright -protected scene data describes a scene which 
is rendered after authentication, and said copyright- 
unprotected scene data describes a scene which is 
rendered irrespective of the authentication. 
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8. An apparatus according to claim 6, further 
comprising instructing means for giving an instruction 
for an access timing in said access control means in 
order to adjust a timing for the rendering by said 
rendering means, 

9. An image processing apparatus comprising: 
detecting means for detecting a copyright 

protection node from a language describing a 3- 
dimensional scene ; 

identifying means for identifying a 3-dimensional 
object designated by the copyright protection node 
detected by said detecting means; and 

display inhibiting means for inhibiting a display 
of the 3-dimensional object identified by said 
identifying means until a predetermined authenticating 
process is finished. 

10. An apparatus according to claim 9, wherein said 
language is a VRML. 

11. An image processing method of displaying a 3- 
dimensional scene, comprising: 

(A) an identifying step of identifying a 3- 
dimensional object having copyright-protected 
information among 3-dimensional objects constructing 
said 3-dimensional scene on the basis of data 



- 25 - 



describing said 3-dimensional scene; and 

(B) a display inhibiting step of inhibiting a 
display of the 3-dimensional object identified in said 
identifying step until a predetermined authenticating 
5 process is finished. 

12. A method according to claim 11, further 
comprising a reproduction inhibiting step of inhibiting 
a reproduction of video/audio in the case where said 3- 

10 dimensional object whose display is inhibited in said 
display inhibiting step is accompanied with the 
video/audio . 

13. A method according to claim 12 r further 

15 comprising a synchronizing step of , in the case where 

said 3-dimensional object whose display is inhibited in 
said display inhibiting step is accompanied with the 
video/audio, synchronizing the display of said 3- 
dimensional object with the reproduction of said 

20 video/audio when the inhibition of the display in said 
display inhibiting step is cancelled. 

14. An image processing method of displaying a 3- 
dimensional scene, comprising: 

25 (A) an identifying step of identifying a 3- 

dimensional object having copyright-protected 
information among 3-dimensional objects constructing 
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said 3 -dimensional scene on the basis of data 
describing said 3 -dimensional scene; and 

(B) a classifying step of classifying the 3- 
dimensional object identified in said identifying step 
to a first group and classifying the other 3- 
dimensional objects to a second group; and 

(C) a display control step of controlling the 
display of said 3-dimensional scene on the basis of the 
group classified in said classifying step. 

15. A method according to claim 14, wherein in said 
classifying step, the 3-dimensional object identified 
in said identifying step and video/audio associated 
with said 3-dimensional object are classified to the 
first group, and the other 3-dimensional objects and 
video/audio associated with said other 3-dimensional 
objects are classified to the second group. 

16. An image processing method comprising: 

(A) a receiving step of receiving scene data 
describing a 3-dimensional scene, media data associated 
with said scene data, and copyright-protected data; 

(B) a separating step of separating all of the data 
received in said receiving step; 

(C) an access control step of controlling accesses 
to the scene data and the media data separated in said 
separating step on the basis of the copyright-protected 
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data separated in said separating step; 

(D) a media decoding step of decoding the media data 
separated in said separating step; 

(E) a scene decoding step of forming copyright - 

5 protected scene data and copyright-unprotected scene 
data from the scene data separated in said separating 
step on the basis of the copyright-protected data 
separated in said separating step; and 

(F) a rendering step of rendering the 3-dimensional 
10 scene on the basis of the media data decoded in said 

media decoding step and the copyright-protected scene 
data and the copyright-unprotected scene data formed in 
said scene decoding step, 

15 17. A method according to claim 16, wherein said 

copyright-protected scene data describes a scene which 
is rendered after authentication, and said copyright- 
unprotected scene data describes a scene which is 
rendered irrespective of the authentication, 

20 

18. A method according to claim 16, further 
comprising an instructing step of giving an instruction 
for an access timing in said access control step in 
order to adjust a timing for the rendering in said 
25 rendering step. 



19. An image processing method comprising: 
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(A) a detec-ting step of detecting a copyright 
protection node from a language describing a 3- 
dimensional scene; 

(B) an identifying step of identifying a 3- 
dimensional object designated by the copyright 
protection node detected in said detecting step; and 

(C) a display inhibiting step of inhibiting a 
display of the 3 -dimensional object identified in said 
identifying step until a predetermined authenticating 
process is finished. 

20. A method according to claim 19, wherein said 
language is a VRML. 

21. An image processing system comprising a 
transmitting apparatus and a receiving apparatus, 
wherein 

(A) said transmitting apparatus includes 
transmitting means for transmitting scene data 

describing a 3-dimensional scene, media data associated 
with said scene data, and copyright-protected data, and 

(B) said receiving apparatus includes: 
receiving means for receiving the scene data 

describing the 3-dimensional scene, the media data 
associated with said scene data, and the copyright- 
protected data which were transmitted from said 
transmitting apparatus ; 
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separating means for separating all of the data 
received by said receiving means; 

access control means for controlling accesses to 
the scene data and the media data separated by said 
separating means on the basis of the copyright- 
protected data separated by said separating means; 

media decoding means for decoding the media data 
separated by said separating means; 

scene decoding means for forming copyright- 
protected scene data and copyright-unprotected scene 
data from the scene data separated by said separating 
means on the basis of the copyright-protected data 
separated by said separating means; and 

rendering means for rendering the 3-dimensional 
scene on the basis of the media data decoded by said 
media decoding means and the copyright -protected scene 
data and the copyright-unprotected scene data formed by 
said scene decoding means. 

22. A storage medium which stores a computer 
program, said computer program comprising: 

(A) an identifying module for identifying a 3- 
dimensional object having copyright-protected 
information among 3-dimensional objects constructing a 
3-dimensional scene on the basis of data describing 
said 3-dimensional scene; and 

(B) a display inhibiting module for inhibiting a 
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display of the 3 -dimensional object identified by the 
identifying process by said identifying module until a 
predetermined authenticating process is finished. 
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ABSTRACT OF THE DISCLOSURE 

An image processing apparatus is constructed by: a 
receiving unit for receiving scene data describing a 3- 
dimensional scene, media data associated with the scene 
5 data, and copyright -protected data; a scene decoder for 
forming copyright-protected scene data and copyright- 
unprotected scene data from the scene data on the basis 
of the copyright-protected data; and a renderer for 
rendering the 3-dimensional scene on the basis of the 
10 media data, the copyright-protected scene data, and the 
copyright-unprotected scene data. 
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Transform { 1 
translation -0.926123 1.49168 -0.0233803 2 
rotation 0 10 0 3 
children { 4 
Transform { 5 
translation 0 0 -0.976271 6 
children [ 7 
Shape { 8 
geometry Box { 1 1 1 } 9 

appearance Appearance { 10 

material Material {} 11 

texture ImageTexture { 12 

url ai Texture1.jpg ai 13 

} 14 

} 15 

} 16 

] 17 

} 18 

Transform { 19 

translation 2.26053 -0.0278533 0.867797 20 

rotation 0 1 0 0 21 

children [ 22 

Shape { 23 

Protect { 24-1 

geometry Cylinder {} 24-2 

url ai IPMP1.dat ai 24-3 

} 24-4 

appearance Appearance { 25 

material Material {} 26 

texture MovieTexture { 27 

url ai Texture2.mpg ai 28 

} 29 

} 30 

} 31 

] 32 

} 33 

] 34 

} 35 

Protect { 35-1 

url ai IPMP1.dat ai 35-2 

Sound { 36 

source AudioClip { 37 

url ai Sound.mpg ai 38 

} 39 

} 40 
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